Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 731 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 74.4 KiB |
| Average record size in memory | 104.2 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 5 |
mnth is highly correlated with season and 3 other fields | High correlation |
season is highly correlated with mnth and 3 other fields | High correlation |
weathersit is highly correlated with hum | High correlation |
temp is highly correlated with mnth and 3 other fields | High correlation |
atemp is highly correlated with mnth and 3 other fields | High correlation |
hum is highly correlated with weathersit and 1 other fields | High correlation |
rentals is highly correlated with mnth and 4 other fields | High correlation |
workingday is highly correlated with weekday and 1 other fields | High correlation |
weekday is highly correlated with workingday | High correlation |
windspeed is highly correlated with hum | High correlation |
weekday has 105 (14.4%) zeros | Zeros |
Reproduction
| Analysis started | 2022-09-20 11:51:22.165614 |
|---|---|
| Analysis finished | 2022-09-20 11:52:20.526444 |
| Duration | 58.36 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
day
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.73871409 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.809948796 |
|---|---|
| Coefficient of variation (CV) | 0.5597629353 |
| Kurtosis | -1.194863701 |
| Mean | 15.73871409 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.006007875846 |
| Sum | 11505 |
| Variance | 77.6151978 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=31)
| Value | Count | Frequency (%) |
| 1 | 24 | 3.3% |
| 2 | 24 | 3.3% |
| 28 | 24 | 3.3% |
| 27 | 24 | 3.3% |
| 26 | 24 | 3.3% |
| 25 | 24 | 3.3% |
| 24 | 24 | 3.3% |
| 23 | 24 | 3.3% |
| 22 | 24 | 3.3% |
| 21 | 24 | 3.3% |
| Other values (21) | 491 |
| Value | Count | Frequency (%) |
| 1 | 24 | |
| 2 | 24 | |
| 3 | 24 | |
| 4 | 24 | |
| 5 | 24 | |
| 6 | 24 | |
| 7 | 24 | |
| 8 | 24 | |
| 9 | 24 | |
| 10 | 24 |
| Value | Count | Frequency (%) |
| 31 | 14 | |
| 30 | 22 | |
| 29 | 23 | |
| 28 | 24 | |
| 27 | 24 | |
| 26 | 24 | |
| 25 | 24 | |
| 24 | 24 | |
| 23 | 24 | |
| 22 | 24 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.519835841 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.451912787 |
|---|---|
| Coefficient of variation (CV) | 0.5294478069 |
| Kurtosis | -1.20911201 |
| Mean | 6.519835841 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.008148650127 |
| Sum | 4766 |
| Variance | 11.91570189 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=12)
| Value | Count | Frequency (%) |
| 1 | 62 | |
| 3 | 62 | |
| 5 | 62 | |
| 7 | 62 | |
| 8 | 62 | |
| 10 | 62 | |
| 12 | 62 | |
| 4 | 60 | |
| 6 | 60 | |
| 9 | 60 | |
| Other values (2) | 117 |
| Value | Count | Frequency (%) |
| 1 | 62 | |
| 2 | 57 | |
| 3 | 62 | |
| 4 | 60 | |
| 5 | 62 | |
| 6 | 60 | |
| 7 | 62 | |
| 8 | 62 | |
| 9 | 60 | |
| 10 | 62 |
| Value | Count | Frequency (%) |
| 12 | 62 | |
| 11 | 60 | |
| 10 | 62 | |
| 9 | 60 | |
| 8 | 62 | |
| 7 | 62 | |
| 6 | 60 | |
| 5 | 62 | |
| 4 | 60 | |
| 3 | 62 |
year
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 KiB |
| 2012 | |
|---|---|
| 2011 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2924 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2011 |
|---|---|
| 2nd row | 2011 |
| 3rd row | 2011 |
| 4th row | 2011 |
| 5th row | 2011 |
Common Values
| Value | Count | Frequency (%) |
| 2012 | 366 | |
| 2011 | 365 |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2012 | 366 | |
| 2011 | 365 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1097 | |
| 1 | 1096 | |
| 0 | 731 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2924 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1097 | |
| 1 | 1096 | |
| 0 | 731 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2924 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1097 | |
| 1 | 1096 | |
| 0 | 731 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2924 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1097 | |
| 1 | 1096 | |
| 0 | 731 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 KiB |
| 3 | |
|---|---|
| 2 | |
| 1 | |
| 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 731 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 188 | |
| 2 | 184 | |
| 1 | 181 | |
| 4 | 178 |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 3 | 188 | |
| 2 | 184 | |
| 1 | 181 | |
| 4 | 178 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 188 | |
| 2 | 184 | |
| 1 | 181 | |
| 4 | 178 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 731 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 188 | |
| 2 | 184 | |
| 1 | 181 | |
| 4 | 178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 731 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 188 | |
| 2 | 184 | |
| 1 | 181 | |
| 4 | 178 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 731 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 188 | |
| 2 | 184 | |
| 1 | 181 | |
| 4 | 178 |
holiday
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 KiB |
| 0 | |
|---|---|
| 1 | 21 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 731 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 710 | |
| 1 | 21 | 2.9% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 710 | |
| 1 | 21 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 710 | |
| 1 | 21 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 731 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 710 | |
| 1 | 21 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 731 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 710 | |
| 1 | 21 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 731 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 710 | |
| 1 | 21 | 2.9% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.997264022 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 105 |
| Zeros (%) | 14.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.004786918 |
|---|---|
| Coefficient of variation (CV) | 0.6688723127 |
| Kurtosis | -1.254282352 |
| Mean | 2.997264022 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.002741597663 |
| Sum | 2191 |
| Variance | 4.019170586 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 6 | 105 | |
| 0 | 105 | |
| 1 | 105 | |
| 2 | 104 | |
| 3 | 104 | |
| 4 | 104 | |
| 5 | 104 |
| Value | Count | Frequency (%) |
| 0 | 105 | |
| 1 | 105 | |
| 2 | 104 | |
| 3 | 104 | |
| 4 | 104 | |
| 5 | 104 | |
| 6 | 105 |
| Value | Count | Frequency (%) |
| 6 | 105 | |
| 5 | 104 | |
| 4 | 104 | |
| 3 | 104 | |
| 2 | 104 | |
| 1 | 105 | |
| 0 | 105 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 731 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 500 | |
| 0 | 231 |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 500 | |
| 0 | 231 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 500 | |
| 0 | 231 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 731 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 500 | |
| 0 | 231 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 731 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 500 | |
| 0 | 231 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 731 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 500 | |
| 0 | 231 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 21 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 731 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 463 | |
| 2 | 247 | |
| 3 | 21 | 2.9% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 463 | |
| 2 | 247 | |
| 3 | 21 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 463 | |
| 2 | 247 | |
| 3 | 21 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 731 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 463 | |
| 2 | 247 | |
| 3 | 21 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 731 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 463 | |
| 2 | 247 | |
| 3 | 21 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 731 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 463 | |
| 2 | 247 | |
| 3 | 21 | 2.9% |
| Distinct | 499 |
|---|---|
| Distinct (%) | 68.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4953847885 |
| Minimum | 0.0591304 |
|---|---|
| Maximum | 0.861667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 KiB |
Quantile statistics
| Minimum | 0.0591304 |
|---|---|
| 5-th percentile | 0.2135685 |
| Q1 | 0.3370835 |
| median | 0.498333 |
| Q3 | 0.6554165 |
| 95-th percentile | 0.76875 |
| Maximum | 0.861667 |
| Range | 0.8025366 |
| Interquartile range (IQR) | 0.318333 |
Descriptive statistics
| Standard deviation | 0.1830509961 |
|---|---|
| Coefficient of variation (CV) | 0.3695127512 |
| Kurtosis | -1.118864155 |
| Mean | 0.4953847885 |
| Median Absolute Deviation (MAD) | 0.158333 |
| Skewness | -0.05452096476 |
| Sum | 362.1262804 |
| Variance | 0.03350766718 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.635 | 5 | 0.7% |
| 0.265833 | 5 | 0.7% |
| 0.68 | 4 | 0.5% |
| 0.710833 | 4 | 0.5% |
| 0.564167 | 4 | 0.5% |
| 0.484167 | 4 | 0.5% |
| 0.649167 | 4 | 0.5% |
| 0.696667 | 4 | 0.5% |
| 0.4375 | 4 | 0.5% |
| 0.606667 | 3 | 0.4% |
| Other values (489) | 690 |
| Value | Count | Frequency (%) |
| 0.0591304 | 1 | |
| 0.0965217 | 1 | |
| 0.0973913 | 1 | |
| 0.1075 | 1 | |
| 0.1275 | 1 | |
| 0.134783 | 1 | |
| 0.138333 | 1 | |
| 0.144348 | 1 | |
| 0.15 | 1 | |
| 0.150833 | 1 |
| Value | Count | Frequency (%) |
| 0.861667 | 1 | |
| 0.849167 | 1 | |
| 0.848333 | 1 | |
| 0.838333 | 1 | |
| 0.834167 | 1 | |
| 0.83 | 1 | |
| 0.828333 | 1 | |
| 0.8275 | 1 | |
| 0.8225 | 1 | |
| 0.818333 | 1 |
| Distinct | 690 |
|---|---|
| Distinct (%) | 94.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4743539886 |
| Minimum | 0.0790696 |
|---|---|
| Maximum | 0.840896 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 KiB |
Quantile statistics
| Minimum | 0.0790696 |
|---|---|
| 5-th percentile | 0.2206455 |
| Q1 | 0.3378425 |
| median | 0.486733 |
| Q3 | 0.608602 |
| 95-th percentile | 0.714967 |
| Maximum | 0.840896 |
| Range | 0.7618264 |
| Interquartile range (IQR) | 0.2707595 |
Descriptive statistics
| Standard deviation | 0.1629611784 |
|---|---|
| Coefficient of variation (CV) | 0.3435433922 |
| Kurtosis | -0.9851305305 |
| Mean | 0.4743539886 |
| Median Absolute Deviation (MAD) | 0.135624 |
| Skewness | -0.1310880421 |
| Sum | 346.7527657 |
| Variance | 0.02655634566 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.654688 | 4 | 0.5% |
| 0.375621 | 3 | 0.4% |
| 0.637008 | 3 | 0.4% |
| 0.571975 | 2 | 0.3% |
| 0.466525 | 2 | 0.3% |
| 0.607962 | 2 | 0.3% |
| 0.654042 | 2 | 0.3% |
| 0.32575 | 2 | 0.3% |
| 0.595346 | 2 | 0.3% |
| 0.39835 | 2 | 0.3% |
| Other values (680) | 707 |
| Value | Count | Frequency (%) |
| 0.0790696 | 1 | |
| 0.0988391 | 1 | |
| 0.101658 | 1 | |
| 0.116175 | 1 | |
| 0.11793 | 1 | |
| 0.119337 | 1 | |
| 0.126275 | 1 | |
| 0.144283 | 1 | |
| 0.149548 | 1 | |
| 0.150883 | 1 |
| Value | Count | Frequency (%) |
| 0.840896 | 1 | |
| 0.826371 | 1 | |
| 0.804913 | 1 | |
| 0.804287 | 1 | |
| 0.794829 | 1 | |
| 0.790396 | 1 | |
| 0.786613 | 1 | |
| 0.785967 | 1 | |
| 0.761367 | 1 | |
| 0.757579 | 1 |
| Distinct | 595 |
|---|---|
| Distinct (%) | 81.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6278940629 |
| Minimum | 0 |
|---|---|
| Maximum | 0.9725 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.4074545 |
| Q1 | 0.52 |
| median | 0.626667 |
| Q3 | 0.7302085 |
| 95-th percentile | 0.8685415 |
| Maximum | 0.9725 |
| Range | 0.9725 |
| Interquartile range (IQR) | 0.2102085 |
Descriptive statistics
| Standard deviation | 0.1424290951 |
|---|---|
| Coefficient of variation (CV) | 0.2268361871 |
| Kurtosis | -0.06453013469 |
| Mean | 0.6278940629 |
| Median Absolute Deviation (MAD) | 0.104584 |
| Skewness | -0.06978343399 |
| Sum | 458.99056 |
| Variance | 0.02028604714 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.613333 | 4 | 0.5% |
| 0.605 | 3 | 0.4% |
| 0.59 | 3 | 0.4% |
| 0.538333 | 3 | 0.4% |
| 0.69 | 3 | 0.4% |
| 0.57 | 3 | 0.4% |
| 0.568333 | 3 | 0.4% |
| 0.722917 | 3 | 0.4% |
| 0.552083 | 3 | 0.4% |
| 0.74125 | 3 | 0.4% |
| Other values (585) | 700 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.187917 | 1 | |
| 0.254167 | 1 | |
| 0.275833 | 1 | |
| 0.29 | 1 | |
| 0.302174 | 1 | |
| 0.305 | 1 | |
| 0.31125 | 1 | |
| 0.314167 | 1 | |
| 0.314348 | 1 |
| Value | Count | Frequency (%) |
| 0.9725 | 1 | |
| 0.970417 | 1 | |
| 0.9625 | 1 | |
| 0.949583 | 1 | |
| 0.948261 | 1 | |
| 0.939565 | 1 | |
| 0.93 | 1 | |
| 0.929167 | 1 | |
| 0.925 | 1 | |
| 0.9225 | 1 |
| Distinct | 650 |
|---|---|
| Distinct (%) | 88.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1904862116 |
| Minimum | 0.0223917 |
|---|---|
| Maximum | 0.507463 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 KiB |
Quantile statistics
| Minimum | 0.0223917 |
|---|---|
| 5-th percentile | 0.07961665 |
| Q1 | 0.13495 |
| median | 0.180975 |
| Q3 | 0.2332145 |
| 95-th percentile | 0.343283 |
| Maximum | 0.507463 |
| Range | 0.4850713 |
| Interquartile range (IQR) | 0.0982645 |
Descriptive statistics
| Standard deviation | 0.07749787068 |
|---|---|
| Coefficient of variation (CV) | 0.4068424167 |
| Kurtosis | 0.4109222677 |
| Mean | 0.1904862116 |
| Median Absolute Deviation (MAD) | 0.049129 |
| Skewness | 0.6773454211 |
| Sum | 139.2454207 |
| Variance | 0.00600591996 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.134954 | 3 | 0.4% |
| 0.228858 | 3 | 0.4% |
| 0.136817 | 3 | 0.4% |
| 0.1107 | 3 | 0.4% |
| 0.118792 | 3 | 0.4% |
| 0.149883 | 3 | 0.4% |
| 0.167912 | 3 | 0.4% |
| 0.166667 | 3 | 0.4% |
| 0.10635 | 3 | 0.4% |
| 0.180975 | 2 | 0.3% |
| Other values (640) | 702 |
| Value | Count | Frequency (%) |
| 0.0223917 | 1 | |
| 0.0423042 | 1 | |
| 0.0454042 | 1 | |
| 0.0454083 | 1 | |
| 0.04665 | 1 | |
| 0.047275 | 1 | |
| 0.0503792 | 1 | |
| 0.0528708 | 1 | |
| 0.053213 | 1 | |
| 0.057225 | 1 |
| Value | Count | Frequency (%) |
| 0.507463 | 1 | |
| 0.441563 | 1 | |
| 0.422275 | 1 | |
| 0.421642 | 1 | |
| 0.417908 | 1 | |
| 0.415429 | 1 | |
| 0.4148 | 1 | |
| 0.409212 | 1 | |
| 0.407346 | 1 | |
| 0.398008 | 1 |
| Distinct | 606 |
|---|---|
| Distinct (%) | 82.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 848.1764706 |
| Minimum | 2 |
|---|---|
| Maximum | 3410 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 88 |
| Q1 | 315.5 |
| median | 713 |
| Q3 | 1096 |
| 95-th percentile | 2355 |
| Maximum | 3410 |
| Range | 3408 |
| Interquartile range (IQR) | 780.5 |
Descriptive statistics
| Standard deviation | 686.6224883 |
|---|---|
| Coefficient of variation (CV) | 0.8095278661 |
| Kurtosis | 1.322074327 |
| Mean | 848.1764706 |
| Median Absolute Deviation (MAD) | 396 |
| Skewness | 1.266454032 |
| Sum | 620017 |
| Variance | 471450.4414 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 120 | 4 | 0.5% |
| 968 | 4 | 0.5% |
| 163 | 3 | 0.4% |
| 653 | 3 | 0.4% |
| 123 | 3 | 0.4% |
| 140 | 3 | 0.4% |
| 244 | 3 | 0.4% |
| 639 | 3 | 0.4% |
| 775 | 3 | 0.4% |
| 1198 | 2 | 0.3% |
| Other values (596) | 700 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 9 | 2 | |
| 15 | 1 | |
| 25 | 1 | |
| 34 | 1 | |
| 38 | 2 | |
| 41 | 1 | |
| 42 | 1 | |
| 43 | 1 | |
| 46 | 1 |
| Value | Count | Frequency (%) |
| 3410 | 1 | |
| 3283 | 1 | |
| 3252 | 1 | |
| 3160 | 1 | |
| 3155 | 1 | |
| 3065 | 1 | |
| 3031 | 1 | |
| 2963 | 1 | |
| 2855 | 1 | |
| 2846 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| day | mnth | year | season | holiday | weekday | workingday | weathersit | temp | atemp | hum | windspeed | rentals | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 1 | 2011 | 1 | 0 | 6 | 0 | 2 | 0.344167 | 0.363625 | 0.805833 | 0.160446 | 331 |
| 1 | 2 | 1 | 2011 | 1 | 0 | 0 | 0 | 2 | 0.363478 | 0.353739 | 0.696087 | 0.248539 | 131 |
| 2 | 3 | 1 | 2011 | 1 | 0 | 1 | 1 | 1 | 0.196364 | 0.189405 | 0.437273 | 0.248309 | 120 |
| 3 | 4 | 1 | 2011 | 1 | 0 | 2 | 1 | 1 | 0.200000 | 0.212122 | 0.590435 | 0.160296 | 108 |
| 4 | 5 | 1 | 2011 | 1 | 0 | 3 | 1 | 1 | 0.226957 | 0.229270 | 0.436957 | 0.186900 | 82 |
| 5 | 6 | 1 | 2011 | 1 | 0 | 4 | 1 | 1 | 0.204348 | 0.233209 | 0.518261 | 0.089565 | 88 |
| 6 | 7 | 1 | 2011 | 1 | 0 | 5 | 1 | 2 | 0.196522 | 0.208839 | 0.498696 | 0.168726 | 148 |
| 7 | 8 | 1 | 2011 | 1 | 0 | 6 | 0 | 2 | 0.165000 | 0.162254 | 0.535833 | 0.266804 | 68 |
| 8 | 9 | 1 | 2011 | 1 | 0 | 0 | 0 | 1 | 0.138333 | 0.116175 | 0.434167 | 0.361950 | 54 |
| 9 | 10 | 1 | 2011 | 1 | 0 | 1 | 1 | 1 | 0.150833 | 0.150888 | 0.482917 | 0.223267 | 41 |
Last rows
| day | mnth | year | season | holiday | weekday | workingday | weathersit | temp | atemp | hum | windspeed | rentals | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 721 | 22 | 12 | 2012 | 1 | 0 | 6 | 0 | 1 | 0.265833 | 0.236113 | 0.441250 | 0.407346 | 205 |
| 722 | 23 | 12 | 2012 | 1 | 0 | 0 | 0 | 1 | 0.245833 | 0.259471 | 0.515417 | 0.133083 | 408 |
| 723 | 24 | 12 | 2012 | 1 | 0 | 1 | 1 | 2 | 0.231304 | 0.258900 | 0.791304 | 0.077230 | 174 |
| 724 | 25 | 12 | 2012 | 1 | 1 | 2 | 0 | 2 | 0.291304 | 0.294465 | 0.734783 | 0.168726 | 440 |
| 725 | 26 | 12 | 2012 | 1 | 0 | 3 | 1 | 3 | 0.243333 | 0.220333 | 0.823333 | 0.316546 | 9 |
| 726 | 27 | 12 | 2012 | 1 | 0 | 4 | 1 | 2 | 0.254167 | 0.226642 | 0.652917 | 0.350133 | 247 |
| 727 | 28 | 12 | 2012 | 1 | 0 | 5 | 1 | 2 | 0.253333 | 0.255046 | 0.590000 | 0.155471 | 644 |
| 728 | 29 | 12 | 2012 | 1 | 0 | 6 | 0 | 2 | 0.253333 | 0.242400 | 0.752917 | 0.124383 | 159 |
| 729 | 30 | 12 | 2012 | 1 | 0 | 0 | 0 | 1 | 0.255833 | 0.231700 | 0.483333 | 0.350754 | 364 |
| 730 | 31 | 12 | 2012 | 1 | 0 | 1 | 1 | 2 | 0.215833 | 0.223487 | 0.577500 | 0.154846 | 439 |